Separate API calls from install logic #29

MillironX · 2023-12-23T17:53:19Z

Background

This is phase two of my four-part plan to fix the gripes with this action

Add code coverage metrics (so we know where we came from)
Separate API calls from install logic (cleanly separating these should greatly simplify the required fixes for Clean up * imports #7, Hitting API Rate limit fails ungracefully #19, Check that Java is installed #24)
Aside: apply the fixes
Address Change repo to nf-core/actions #22 (now that we have a clean code base, we can build upon it)

This PR

Separate API calls from install logic

Merry Christmas, nf-core team! Here is a major overhaul to the action code, which will allow us to add mocks, create unit tests, and potentially even substitute caches or external APIs when the GitHub API hits a rate limit. I might want to add a few more tests before merging, but would love feedback now.

Nextflow binary URLs should be stored in the new NextflowRelease object, so it should essentially be private for the NextflowRelease type. Move the function to that file to signify that (but retain export for testing purposes).

WIth the goal being to convert Octokit data into NextflowRelease objects, create a function that can do that in one line.

Separation of concerns. We want to completely separate the internals of Octokit from this application, so move any reference/call to Octokit into its own file to symbolize that.

MillironX · 2024-01-06T16:27:29Z

Hey, @edmundmiller. Everything is ready to go, except for the filename lint rules. I can't find what these rules are, or where they come from. Could you point me in the right direction, and then I can fix that?

…xtflow into feature/release-model

mashehu · 2024-01-09T12:36:07Z

Hey, @edmundmiller. Everything is ready to go, except for the filename lint rules. I can't find what these rules are, or where they come from. Could you point me in the right direction, and then I can fix that?

These linting rules are quite hidden: the culprit was this one: https://github.com/github/eslint-plugin-github/blob/main/lib/configs/recommended.js#L21

edmundmiller

Awesome work!

I'm wondering if there's any way we can "sniff" the number of API requests in the tests? If there's not a quick resolution let's just make a follow up issue.

mashehu · 2024-01-09T14:14:11Z

You can read the rate limit in the response header under x-ratelimit-remaining

MatthiasZepper · 2024-01-09T16:20:51Z

You can read the rate limit in the response header under x-ratelimit-remaining

Or avoid reinventing the wheel :-)

edmundmiller · 2024-01-10T00:34:59Z

@MatthiasZepper where have you been hiding that???

But I was actually talking about in this code base to see how many we're using and then do a back of the napkin calculation.

MillironX · 2024-01-10T02:22:09Z

I'm wondering if there's any way we can "sniff" the number of API requests in the tests?

Not sure why? 😕 There are exactly ~~three~~five API calls made by the test suite now: all other "API" tests now use static objects.

edmundmiller · 2024-01-10T03:04:42Z

Not sure why? 😕 There are exactly threefive API calls made by the test suite now: all other "API" tests now use static objects.

To make sure we don't introduce any regressions because it's been a huge pain during hackathons and for Sarek, and we're about to push it to the limit with the scatter CI in methylseq that I hope to add to the template.

MatthiasZepper · 2024-01-10T15:23:22Z

@MatthiasZepper where have you been hiding that???

In plain sight, obviously. Bragged about it a lot - so often that I already felt bad about it.

Regarding scatter: Just mind that the billable minute are calculated per job level. As far as I know, the nf-core organization doesn't have to pay for the Actions' runtime, but should GitHub ever change its pricing model, we need to be careful as separating everything into nano-jobs will have a great impact then.

MillironX · 2024-01-10T22:53:52Z

To make sure we don't introduce any regressions

Regressions in what, exactly? You're worried that the action is going to today use, say 5 API calls, and then we might break it by introducing a bug that uses, say 10 API calls?

because it's been a huge pain during hackathons and for Sarek...

I get that, but that's during unit testing of pipelines, not of this action, right? So in that case, trying to find out the timeouts beforehand would happen exactly as we've talked about in #19, and we can add timeout and cooldown abilities to OctokitWrapper easily now.

edmundmiller · 2024-01-12T02:04:50Z

Regarding scatter

Yeah I think we're going to have to cross that bridge if they change the pricing model.

I'm not following though, you're saying if they charge per job? I think the math would come out pretty close, the tests are taking 1-2 minutes a piece.

The way it's broken up, we could shard them into 4 sets as well, instead of 20 jobs. That could be a better idea anyway because we'll start saturating the free runners quickly.

edmundmiller · 2024-01-12T02:09:10Z

Regressions in what, exactly? You're worried that the action is going to today use, say 5 API calls, and then we might break it by introducing a bug that uses, say 10 API calls?

Exactly!

Anyway it was just a wish list item, I made #30 if we ever get time/motivation to follow up so good to merge with me @MillironX !

MatthiasZepper · 2024-01-12T07:46:46Z

Firstly, sorry @MillironX for hijacking the discussion on your PR. I am neutral and will not submit a review on this PR, since I not even remotely understand all the consideration that went into it.

I'm not following though, you're saying if they charge per job? I think the math would come out pretty close, the tests are taking 1-2 minutes a piece.

I was referring to this section from the HowTo article on Token rotation from Shopify:

Our original cost estimates included certain calculation assumptions and misconceptions that ended up not holding true in production.

The first assumption was mostly due to a lack of research: the granularity of what GitHub refers to as “billable minutes”. Understanding that the granularity of a billable minute is at, well, the minute-level was one thing; the other thing was the way that the rounding and bucketing works.

Billable minute calculations are not performed at an organization level, but rather at the job level. Each workflow has N underlying jobs, and each underlying job has an execution duration that is rounded to the nearest minute. These rounded nearest minutes are then summed together–even if the jobs were run in parallel–and billed to the organization. Because of this, workflows that ran 10 parallel jobs that would execute in 1 second each would end up accruing 10 billable minutes.

These miscalculations ended up exploding our costs during our prototyping phase and led to an architecture refactoring to sequentially executing all downstream workflows in a single job.

MillironX requested a review from edmundmiller December 23, 2023 17:53

MillironX marked this pull request as draft December 23, 2023 18:02

MillironX added 15 commits December 23, 2023 13:16

feat: Add NextflowRelease type

4b07819

feat: Add nextflow_release function

0269e9e

WIth the goal being to convert Octokit data into NextflowRelease objects, create a function that can do that in one line.

refactor!: Move all Octokit code to own file

7881b23

Separation of concerns. We want to completely separate the internals of Octokit from this application, so move any reference/call to Octokit into its own file to symbolize that.

refactor: Make tag filtering work on NextflowRelease objects

fd8114e

refactor: Make get_nextflow_release work using NextflowRelease objects

59eaec9

refactor: Make install_nextflow work using NextflowRelease objects

4a14083

refactor: Add and remove imports from main script

b1bddc1

docs: Clarify why CAPSULE_LOG is set

bdadd86

refactor: Outsource API calls to OctokitWrapper

4fc23b5

refactor: Update main script to use NextflowRelease objects

7e26a1f

test: Update API consistency tests

4443761

test: Update version checking tests

0025b1c

fix: Add check for non-semver version strings in cache check

ab9802a

fix: Support versions of Nextflow without "all" variants

d70a853

MillironX marked this pull request as ready for review January 6, 2024 16:26

mashehu added 4 commits January 9, 2024 13:28

make linter happy

00677a4

make linter happier

c319290

Merge branch 'feature/release-model' of github.com:MillironX/setup-ne…

80a6385

…xtflow into feature/release-model

add todo item back

2d6921d

edmundmiller assigned MillironX Jan 9, 2024

edmundmiller added this to the 2.0.0 milestone Jan 9, 2024

edmundmiller approved these changes Jan 9, 2024

View reviewed changes

edmundmiller mentioned this pull request Jan 12, 2024

Test number of API calls made in an install #30

Open

MillironX merged commit b30f81e into nf-core:master Jan 12, 2024
16 checks passed

MillironX deleted the feature/release-model branch January 12, 2024 18:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate API calls from install logic #29

Separate API calls from install logic #29

MillironX commented Dec 23, 2023

MillironX commented Jan 6, 2024

mashehu commented Jan 9, 2024

edmundmiller left a comment

mashehu commented Jan 9, 2024

MatthiasZepper commented Jan 9, 2024

edmundmiller commented Jan 10, 2024

MillironX commented Jan 10, 2024 •

edited

Loading

edmundmiller commented Jan 10, 2024

MatthiasZepper commented Jan 10, 2024 •

edited

Loading

MillironX commented Jan 10, 2024

edmundmiller commented Jan 12, 2024

edmundmiller commented Jan 12, 2024

MatthiasZepper commented Jan 12, 2024 •

edited

Loading

Separate API calls from install logic #29

Separate API calls from install logic #29

Conversation

MillironX commented Dec 23, 2023

Background

This PR

Separate API calls from install logic

MillironX commented Jan 6, 2024

mashehu commented Jan 9, 2024

edmundmiller left a comment

Choose a reason for hiding this comment

mashehu commented Jan 9, 2024

MatthiasZepper commented Jan 9, 2024

edmundmiller commented Jan 10, 2024

MillironX commented Jan 10, 2024 • edited Loading

edmundmiller commented Jan 10, 2024

MatthiasZepper commented Jan 10, 2024 • edited Loading

MillironX commented Jan 10, 2024

edmundmiller commented Jan 12, 2024

edmundmiller commented Jan 12, 2024

MatthiasZepper commented Jan 12, 2024 • edited Loading

MillironX commented Jan 10, 2024 •

edited

Loading

MatthiasZepper commented Jan 10, 2024 •

edited

Loading

MatthiasZepper commented Jan 12, 2024 •

edited

Loading